Search for: All records

Creators/Authors contains: "Metaxas, Dimitris N."

« Prev Next »

Total Resources

19

Resource Type
Conference Paper

10

Conference Proceeding

0

Dataset

0

Journal Article

9

Workshop Report

0

Availability
Full Text / Resource Available

18

Citation Only

1

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization

Xia, Zhaoyang ; Neidle, Carol ; Metaxas, Dimitris N. ( November 2023 , arXiv)

Since American Sign Language (ASL) has no standard written form, Deaf signers frequently share videos in order to communicate in their native language. However, since both hands and face convey critical linguistic information in signed languages, sign language videos cannot preserve signer privacy. While signers have expressed interest, for a variety of applications, in sign language video anonymization that would effectively preserve linguistic content, attempts to develop such technology have had limited success, given the complexity of hand movements and facial expressions. Existing approaches rely predominantly on precise pose estimations of the signer in video footage and often require sign language video datasets for training. These requirements prevent them from processing videos 'in the wild,' in part because of the limited diversity present in current sign language video datasets. To address these limitations, our research introduces DiffSLVA, a novel methodology that utilizes pre-trained large-scale diffusion models for zero-shot text-guided sign language video anonymization. We incorporate ControlNet, which leverages low-level image features such as HED (Holistically-Nested Edge Detection) edges, to circumvent the need for pose estimation. Additionally, we develop a specialized module dedicated to capturing facial expressions, which are critical for conveying essential linguistic information in signed languages. We then combine the above methods to achieve anonymization that better preserves the essential linguistic content of the original signer. This innovative methodology makes possible, for the first time, sign language video anonymization that could be used for real-world applications, which would offer significant benefits to the Deaf and Hard-of-Hearing communities. We demonstrate the effectiveness of our approach with a series of signer anonymization experiments.
more » « less
Free, publicly-accessible full text available November 27, 2024
Mining multi-center heterogeneous medical data with distributed synthetic learning

https://doi.org/10.1038/s41467-023-40687-y

Chang, Qi ; Yan, Zhennan ; Zhou, Mu ; Qu, Hui ; He, Xiaoxiao ; Zhang, Han ; Baskaran, Lohendran ; Al’Aref, Subhi ; Li, Hongsheng ; Zhang, Shaoting ; et al ( September 2023 , Nature Communications)

Abstract
Overcoming barriers on the use of multi-center data for medical analytics is challenging due to privacy protection and data heterogeneity in the healthcare system. In this study, we propose the Distributed Synthetic Learning (DSL) architecture to learn across multiple medical centers and ensure the protection of sensitive personal information. DSL enables the building of a homogeneous dataset with entirely synthetic medical images via a form of GAN-based synthetic learning. The proposed DSL architecture has the following key functionalities: multi-modality learning, missing modality completion learning, and continual learning. We systematically evaluate the performance of DSL on different medical applications using cardiac computed tomography angiography (CTA), brain tumor MRI, and histopathology nuclei datasets. Extensive experiments demonstrate the superior performance of DSL as a high-quality synthetic medical image provider by the use of an ideal synthetic quality metric called Dist-FID. We show that DSL can be adapted to heterogeneous data and remarkably outperforms the real misaligned modalities segmentation model by 55% and the temporal datasets segmentation model by 8%.

more » « less
A Manifold View of Adversarial Risk

Zhang, Wenjia ; Zhang, Yikai ; Hu, Xiaoling ; Goswami, Mayank ; Chen, Chao ; Metaxas, Dimitris N. ( January 2023 , Proceedings of Machine Learning Research)

The adversarial risk of a machine learning model has been widely studied. Most previous works assume that the data lies in the whole ambient space. We propose to take a new angle and take the manifold assumption into consideration. Assuming data lies in a manifold, we investigate two new types of adversarial risk, the normal adversarial risk due to perturbation along normal direction, and the in-manifold adversarial risk due to perturbation within the manifold. We prove that the classic adversarial risk can be bounded from both sides using the normal and in-manifold adversarial risks. We also show with a surprisingly pessimistic case that the standard adversarial risk can be nonzero even when both normal and in-manifold risks are zero. We finalize the paper with empirical studies supporting our theoretical results. Our results suggest the possibility of improving the robustness of a classifier by only focusing on the normal adversarial risk.
more » « less
Full Text Available
A Manifold View of Adversarial Risk

Zhang, Wenjia ; Zhang, Yikai ; Hu, Xiaoling ; Goswami, Mayank ; Chen, Chao ; Metaxas, Dimitris N. ( April 2022 , Proceedings of The 25th International Conference on Artificial Intelligence and Statistics)

The adversarial risk of a machine learning model has been widely studied. Most previous works assume that the data lies in the whole ambient space. We propose to take a new angle and take the manifold assumption into consideration. Assuming data lies in a manifold, we investigate two new types of adversarial risk, the normal adversarial risk due to perturbation along normal direction, and the in-manifold adversarial risk due to perturbation within the manifold. We prove that the classic adversarial risk can be bounded from both sides using the normal and in-manifold adversarial risks. We also show with a surprisingly pessimistic case that the standard adversarial risk can be nonzero even when both normal and in-manifold risks are zero. We finalize the paper with empirical studies supporting our theoretical results. Our results suggest the possibility of improving the robustness of a classifier by only focusing on the normal adversarial risk.
more » « less
Full Text Available
Genetic mutation and biological pathway prediction based on whole slide images in breast carcinoma using deep learning

https://doi.org/10.1038/s41698-021-00225-9

Qu, Hui ; Zhou, Mu ; Yan, Zhennan ; Wang, He ; Rustgi, Vinod K. ; Zhang, Shaoting ; Gevaert, Olivier ; Metaxas, Dimitris N. ( September 2021 , npj Precision Oncology)

Abstract
Breast carcinoma is the most common cancer among women worldwide that consists of a heterogeneous group of subtype diseases. The whole-slide images (WSIs) can capture the cell-level heterogeneity, and are routinely used for cancer diagnosis by pathologists. However, key driver genetic mutations related to targeted therapies are identified by genomic analysis like high-throughput molecular profiling. In this study, we develop a deep-learning model to predict the genetic mutations and biological pathway activities directly from WSIs. Our study offers unique insights into WSI visual interactions between mutation and its related pathway, enabling a head-to-head comparison to reinforce our major findings. Using the histopathology images from the Genomic Data Commons Database, our model can predict the point mutations of six important genes (AUC 0.68–0.85) and copy number alteration of another six genes (AUC 0.69–0.79). Additionally, the trained models can predict the activities of three out of ten canonical pathways (AUC 0.65–0.79). Next, we visualized the weight maps of tumor tiles in WSI to understand the decision-making process of deep-learning models via a self-attention mechanism. We further validated our models on liver and lung cancers that are related to metastatic breast cancer. Our results provide insights into the association between pathological image features, molecular outcomes, and targeted therapies for breast cancer patients.

more » « less
Error-Bounded Correction of Noisy Labels

Zheng, Songzhu ; Wu, Pengxiang ; Goswami, Aman ; Goswami, Mayank ; Metaxas, Dimitris N. ; Chen, Chao ( December 2020 , Proceedings of Machine Learning Research)
null (Ed.)
Full Text Available
Towards Image-to-Video Translation: A Structure-Aware Approach via Multi-stage Generative Adversarial Networks

https://doi.org/10.1007/s11263-020-01328-9

Zhao, Long ; Peng, Xi ; Tian, Yu ; Kapadia, Mubbasir ; Metaxas, Dimitris N. ( November 2020 , International Journal of Computer Vision)
null (Ed.)
Full Text Available
Knowledge As Priors: Cross-Modal Knowledge Generalization for Datasets Without Superior Knowledge

https://doi.org/10.1109/CVPR42600.2020.00656

Zhao, Long ; Peng, Xi ; Chen, Yuxiao ; Kapadia, Mubbasir ; Metaxas, Dimitris N. ( June 2020 , Computer Vision and Pattern Recognition)
null (Ed.)
Full Text Available
Knowledge as Priors: Cross-Modal Knowledge Generalization for Datasets without Superior Knowledge

Zhao, Long ; Peng, Xi ; Chen, Yuxiao ; Kapadia, Mubbasir ; Metaxas, Dimitris N. ( April 2020 , Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR))

Full Text Available
Synthetic Learning: Learn From Distributed Asynchronized Discriminator GAN Without Sharing Medical Image Data

https://doi.org/10.1109/CVPR42600.2020.01387

Chang, Qi ; Qu, Hui ; Zhang, Yikai ; Sabuncu, Mert ; Chen, Chao ; Zhang, Tong ; Metaxas, Dimitris N. ( June 2020 , IEEE Conference on Computer Vision and Pattern Recognition (CVPR))
null (Ed.)
Full Text Available

« Prev Next »